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O ' Abstract 

(N' 

We prove that the Bounded Occurrence Ordering fc-CSP Problem is not approximation 
resistant. We give a very simple local search algorithm that always performs better than the 
random assignment algorithm (unless, the number of satisfied constraints does not depend on 
the ordering). Specifically, the expected value of the solution returned by the algorithm is at 
least 

Alg > Avg + a(B, fc)(OPT - Ave), 



Q where Opt is the value of the optimal solution; Avg is the expected value of the random solution; 
-_J -Yd O o ,T>-(k+0(l))\ : J J: 1 /- .f,!.. __j 



and a(B, k) = Clf.(B~( fe +°W) is a parameter depending only on k (the arity of the CSP) and 
B (the maximum number of times each variable is used in constraints) . 

The question whether bounded occurrence ordering /c-CSPs are approximation resistant was 
raised by Guruswami and Zhou (2012), who recently showed that bounded occurrence 3-CSPs 
and "monotone" fc-CSPs admit a non-trivial approximation. 
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Overview. In this work, we give a very simple local search algorithm for ordering constraints 
satisfaction problems that works better than the random assignment for those instances of the 
ordering /c-CSP problem, where each variable is used only a bounded number of times. To moti- 
vate the study of the problem, we first overview known results for regular constraint satisfaction 
problems. 

An instance of a constraint satisfaction problem consists of a set of variables V = {x±, . . . , x n } 
taking values in a domain D and a set of constraints C. Each constraint C £ C is a function from 
D k to M + applied to k variables from V. Given an instance of a CSP, our goal is to assign values 
to the variables to maximize the total payoff of all constraints: 

max > C(xi, . . . , x n ). 

Note, that we write C(x\, . . . , x n ) just to simplify the notation. In fact, C may depend on at most 
k variables. The parameter k is called the arity of the CSP. In specific CSP problems, constraints 
C may only belong to a special family of constraints. For example, in Max Cut, the domain is 
D = {—1, 1}, and all constraints have the form C(xi, . . . ,x n ) = l(xi ^ Xj); in Max 3LIN-2, the 
domain D = {0, 1}, and all constraints have the form C(x±, . . . ,x n ) = © Xj © x\ = 0) or 

C(xi,...,X n ) = l(Xj©Xj©X/ = 1). 
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Various approximation algorithms have been designed for CSPs. The most basic among them, 
the "trivial" probabilistic algorithm simply assigns random values to the variables. It turns out, 
however, that in some cases this algorithm is essentially optimal. Hastad (1997) showed that for 
some CSPs e.g., 3LIN-2 and E3-SAT, beating the approximation ratio of the random assignment 
algorithm (by any positive constant e) is NP-hard. Such problems are called approximation re- 
sistant. That is, a constraint satisfaction problem is approximation resistant, if for every positive 
e > 0, it is NP-hard to find a {A tr i V i a i + e) approximation, where A tr i V i a [ is the approximation ratio 
of the random assignment algorithm. If there exists an algorithm with the approximation ratio 
(Atrivial + e ) f° r some positive e, we say that the problem admits a non-trivial approximation. It is 
still not known which constraint satisfaction problems are approximation resistant and which admit 
a non-trivial approximation. This is an active research question in approximation algorithms. 

Suppose now that in our instance of £>CSP, each variable is used by at most B constraints. (For 
example, for Max Cut, this means that the maximum degree of the graph is bounded by B.) Hastad 
(2000) proved that such instances (which we call B-bounded occurrence fc-CSPs) admit non-trivial 
approximation. Let Opt denote the value of the optimal solution; Avg denote the expected value 
of the random assignment; and Alg denote the expected value returned by the algorithm. Hastad 
(2000) showed that there exists an approximation algorithm such tha10 

Opt - Avg 
Alg > Avg H 0^{B) 

Here the hidden constant in Ok(-) may depend on k. Trevisan (2001) showed a hardness of approx- 
imation lower bound of Avg + (Opt — AvG)/(fifc(\/f?))- 

In this work, we study ordering constraints satisfaction problems. A classical example of an 
ordering &-CSP is the Maximum Acyclic Subgraph problem. Given a directed graph G = (V,E), 
the goal is to find an ordering of the vertices tt : V — > {1, . . . ,n} (jr is a bijection; n = \V\), so 
as to maximize the number of forward edges. In this case, the edges of the graph are constraints 
on the ordering tt. An edge (u,v) corresponds to the constraint ir(u) < ir(v). Another example 
is the Betweenness problem. We are given a set of vertices V and a set of constraints {C UjVjW }. 
Each C UtVtW is defined as follows: C UjVjV1 {it) = 1, if u < v < w or w < v < u, and C UtV)W (iT) = 0, 
otherwise. The goal again is to find an ordering satisfying the maximum number of constraints. 

More generally, in an ordering /c-CSP, each constraint C is a function of the ordering that 
depends only on the relative order of k vertices. The goal is given a set of vertices V and a set of 
constraints C, to find an ordering 7r : V — >■ [n] to maximize the total value of all constraints: 

max > C(tt). 

Here tt is a bijection and n = \ V\. If all constraints take values {0, 1}, then the objective is simply 
to maximize the number of satisfied constraints. Note, that an ordering A;-CSP is not a A:-CSP. 

Surprisingly, we know more about ordering CSPs than about regular CSPs. Guruswami, Hastad, 
Manokaran, Raghavendra, and Charikar (2011) showed that every ordering CSP problem is ap- 
proximation resistant assuming the Unique Games Conjecture (see also (Guruswami et al. 2008) 



x Thc quantity ("value of the solution" — Avg) is called the advantage over random. The algorithm of Hastad 
(2000) is Ok(B) approximation algorithm for the advantage over random: 

Opt — Avg 
ALG " AVG ^ O k (B) ■ 
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and (Charikar et al. 2009)). On the positive side, Berger and Shor (1990) showed that bounded 
degree Maximum Acyclic Subgraph, and thus every bounded occurrence ordering 2CSP, admits 
a non-trivial approximation. Their result implies that Alg > Avg + (Opt — Avg) fO(y/B). 
Charikar, Makarychev, and Makarychev (2007) showed that a slight advantage over the ran- 
dom assignment algorithm can be also achieved for instances of Maximum Acyclic Subgraph 
(Alg > Avg + (Opt — Av~G)/0(logn)) whose maximum degree is not bounded. Finally, Gu- 
ruswami and Zhou (2012) proved that all bounded occurrence ordering 3CSPs admit a non-trivial 
approximation (Alg > Avg + (Opt — Avg) /Ok(B)). They also proved that there exists an ap- 
proximation algorithm for monotone fc-CSP (i.e., ordering CSPs, where all constraints are of the 
form ^(u^) < 7r(uj 2 ) < • • • < 7r(iij fe )) with approximation ratio 1/kl + l/Ok(B). 

Our results. We show that a very simple randomized local search algorithm finds a solution 
of expected value: 

Opt - Avg 
Alg > Avg + Qfc(gfc+2) . 

This algorithm works for every bounded occurrence ordering fc-CSP. Consequently, all bounded 
occurrence ordering fc-CSPs admit a non-trivial approximation. The running time of the algorithm 
is 0(n log n) (or, even, 0{n) if generating a random number from 1 to n takes a unit of time). It 
can be derandomized, and the approximation guarantee can be slightly improved to Avg + (Opt — 
Avg) /Ok(B k l 2+2 ) (using a more complex algorithm; see Appendix). We do not know whether 
the dependence on B is optimal. However, the result of Trevisan (2001) implies a hardness of 
approximation upper bound of Alg + (Opt - Avg) /Q. k {\fB%. 

Techniques. Our algorithm works as follows: first, it permutes all vertices in a random order. 
Then, n times, it picks a random vertex and moves it to the optimal position without changing the 
positions of other vertices. We give an elementary proof that this algorithm performs better than 
the random assignment. However, the bound we get is exponentially small in B. 

Then, we improve this bound. Roughly speaking, instead of the original problem we consider 
the "D-ordering" problem, where the algorithm puts vertices in D ~ Bk buckets (possibly, in a 
clever way), then it randomly permutes vertices in each of the buckets, and finally outputs vertices 
in the first bucket, second bucket, etc. This idea was previously used by Charikar, Makarychev, 
and Makarychev (2007), Guruswami, Hastad, Manokaran, Raghavendra, and Charikar (2011), and 
Guruswami and Zhou (2012). The transition to "D-orderings" allows us to represent the payoff 
function as a Fourier series with relatively few terms. We prove that the L\ weight of all coefficients 
of the payoff function is at least Avg + Ofc(OPT — Avg) (Note, that the optimal value of the "D- 
orderings" problem may be less than Opt). Then we show that (a) for each vertex we can find one 
"heavy" Fourier coefficient fs; and (b) when the original local search algorithm moves a vertex it 
increases the value of the solution in expectation by at least Q^fs/B). This concludes the proof. 



2 Preliminaries 

An instance of an ordering /e-CSP problem (V,C) consists of a set of vertices V of size n, and a 
set of constraints C. An ordering of vertices 7r : V — > {1, . . . , n} is a bijection from 1 to {1, . . . , n}. 
Each constraint C 6 C is a function from the set of all ordering Sy = {it : V — > {1, . . . , n}} to M + 
that depends on the relative order of at most k vertices. That is, for every C there exists a set 

Every fc-CSP can be encoded by an ordering 2fc-CSP by replacing every boolean variable x with two variables 
uj~ and , and letting x = 1 if and only if n(v£~) < -k(u^). 
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Tc C V of size at most k such that if for two orderings tt\ and 7T2, 7Ti(u) < 7i"i(?;) 44> ^(u) < ^(u) 
for u, v 6 Tc, then C(7Ti) = C(7r2). The value of an ordering 7r equals 

value(7r,C) = C(tt). 

Cec 

We will sometimes write value((«i, . . . u n ),C) to denote the value(7r,C) for ir : u-i h-> We denote 
the optimal value of the problem by Opt(V,C) = max l6 6 v value(7r, C), the average value — the 
value returned by the random assignment algorithm — by Avg(V,C) = 1/n! X^eS va hie(vr,C). 

3 Algorithm 

We now present the algorithm. 



Randomized Local Search Algorithm 

Input: a set of vertices V, and a set of constraints C. 
Output: an ordering of vertices (vi, . . . , v n ). 

1. Randomly permute all vertices. 

2. Repeat n times: 

• Pick a random vertex u in V. 

• Remove u from the ordering and insert it at a new location to maximize the payoff. 
I.e., if vi, . . . ,v n -i is the current ordering of all vertices but the vertex u, then find a 
location % that maximizes the value(wi, . . . , i>i-i, u, Uj+i, . . . v n -i,C), and put u in the 
i-th position. 

3. Return the obtained ordering. 



Theorem 3.1. Given an instance (V,C) of a B-bounded occurrence ordering k-CSP problem, the 
Randomized Local Search Algorithm returns a solution 7Talg °f expected value 

■ value(W) > Avo(y,C) + ^ff^ffi* ^ (D 

Remark 3.1. We prove a slightly better bound: AvG(V,C) + (OPT(V,C)-WORST(V,C))/O k (B k+2 ), 
where WORST(y, C) = min^-gg^ value(7r, C) is the value of the worst possible solution. 

Proof. I. We first show using an elementary argument that 

E value(7r A LG,C) > Avg{V,C) + a(B, fc)(OPT(V,C) - Avg(V,C)), 

for some function a(B, k) depending only on B and k. This immediately implies that every bounded 
occurrence ordering fc-CSP admits a non-trivial approximation. Then, using a slightly more involved 
argument we prove the bound (P). 
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Observe, that the expected value of the solution after step 1 is exactly equal to Avg(V,C). 
So we need to estimate how much local moves at step 2 improve the solution. Let A u be the 
maximum possible increase in the value of an ordering ir, when we move u to another position. In 
other words, A u = max 7r + A - (value (ir + , C) — value(7r~, C)), where the orderings 7r + and tt~ differ 
only in the position of the vertex u. Let it* be the optimal ordering, and 7r* be the worst possible 
ordering. We can transition from ir* to 7T* by moving every vertex u at most once. Thus, 

A u > value(vr*,C) - value(vr,,C) = Opt(V,C) - WORST(y,C) > Opt(V,C) - Avg(V,C). 

uev 

Now, our goal is to show that when the algorithm moves a vertex u, the value of the solution 
increases in expectation by at least a(B, k)A u for some function a depending only on B and k. 

Fix a vertex u. Let ir + and ir~ be the orderings that differ only in the position of the vertex u 
such that A u = value(7r + ,C) — value(7r~,C). It may happen that the random permutation chosen 
by the algorithm at step 1 is 7r~, and u is chosen first among all vertices in V at step 2. In this case, 
the algorithm can obtain the permutation ir + by moving u, and thus it can increase the value of 
the solution by A u . However, the probability of such event is negligible. It is 1/n • l/n\. The main 
observation is that the increase in the value of the ordering, when we move u, depends only on the 
order of the neighbors of u i.e., those vertices that share at least one common constraint C G C 
with u (including u itself). We denote the set of neighbors by N(u). Since each vertex participates 
in at most B constraints, and every constraint depends on at most k variables, |-/V(it)| < kB. 

Consider an execution of the algorithm. We say that u is fresh if u was chosen at least once in 
the "repeat" loop of the algorithm, and none of the neighbors were chosen before u was chosen the 
first time. The probability that a variable u is fresh is at least 1 /2\ N(u)\ . Indeed, the probability 
that at least one vertex in N(u) is chosen is 1 — (1 — \N(u)\/n) n > 1 — 1 /e; the probability that the 
first vertex chosen in N(u) is u is l/|iV(n)| (since all vertices in N(u) have the same probability of 
being chosen first). 

If u is fresh, then when it is chosen, its neighbors are located in a random order (since none 
of them was moved by the algorithm). Thus, with probability 1/N(u)\ > l/(kB)\, the order of 
neighbors of u is the same as in 7r~. Then, by moving u we can increase the value of the ordering 
by A u . 

Therefore, when the algorithm moves the vertex u, the value of the ordering increases in expec- 
tation by at least 

Pr(-u is fresh) • Pr(iV(u) is ordered as 7r _ after step 1) • A u = " > 



2\N(u)\ \N(u)\\ ~ kB{kB)V 
This finishes the elementary proof that a positive a(B, k) exists. 

II. We now improve the lower bound on a(B, k). We show that for a fresh variable u, the 
value of the ordering increases in expectation by at least f2fc(l? - ( fc+1 ))A tl , and thus holds. Let 
L = \log 2 {\N(u)\ + l)] and D = 2 L . Consider D buckets [D] = {!,.. .,D}. For every mapping x of 
the vertices to the buckets v *— > x v £ [D], we define a distribution IA X on orderings of V . A random 
ordering from IA X is generated as follows: put each vertex v in the bucket x v ; then randomly and 
uniformly permute vertices in each bucket; and finally output vertices in the first bucket, second 
bucket, etc (according to their order in those buckets). Let C u be the set of constraints that 
depend on the vertex u. Since every variable participates in at most B constraints, \C U \ < B. Let 
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f(x) be the expected total value of constraints in C u on a random ordering tt sampled from the 
distribution U x : 

f(x) = E^ Ux [ £ C(tt)" . 
cec u 

Since the number of buckets D is greater than |-/V(ii)| + 1, we may put every vertex in N(u) 
in its own bucket and keep one bucket empty. Let tt + and tt~ be the orderings as in part I of the 
proof: 7r + and tt~ differ only in the position of the vertex u, and value(-7r + , C) — value(7r _ , C) = A u . 
Consider mappings x + : V — > [D] and x~ : V — > [D] that put only one vertex from N(u) in every 
bucket and such that x+ = x~ for every v ^ u, and x + orders vertices in N(u) according to 7r + , x~ 
orders vertices in N(u) according to 7r _ . For example, if 7r + arranges vertices in the order (a, b, u, c), 
and 7r~ arranges vertices in the order (a,u,b,c), then x + = (04 1, *,b i->- 3, it i->- 4,c i->- 5) and 
x~ = (a i— > 1, u i— > 2, b i— > 3, *, c H >• 5). Since the order of all vertices in iV(u) is fixed by x + and a; - , 
we have f(x + ) = value(7r + , C u ) and f(x~) = value(7r~, C u ). Then 

f(x + ) - f{x~) = value(7r + ,C u ) - value(7r", C u ) 

= value(7r + , C) — value(7r _ , C) = A u . 

We now use Theorem 14.11 which we prove in Section HI Let X v (for v £ V) be independent random 
variables uniformly distributed in [D]. By Theorem 14.11 

E[mnxf{x u ,{X v } v ^ u )-f(X u ,{X v } v ^ u )} > Q k (B- 1 D- k )(f(x + )- f(x~)) 

= Q k (B-^)A u . 

Here, (x u ,{X v } Vf L u ) denotes the mapping u i->- x u and v \-t X v for v ^ u; and (X U ,{X V } V ^ U ) 
denotes the mapping v i— > for all v. 

Observe, that when we sample random variables Xy, and then sample it according to Ux, we 
get a random uniform ordering tt of all vertices in V. Thus, 

E[f(X u ,{X v } v ^ u )}=E weev [ C ( 7T )} =E„[value(7r,C u )]. 

cec u 

Similarly, when we sample random variables X v , set x u = argmax^g^ f(x u , {X v } v ^ u ), and then 
sample it' according to M( Xu ,{X v } V7 l u ), we get a random uniform ordering of all vertices except for 
the vertex u. Denote by LS(ir, u) the ordering obtained from the ordering tt by moving the vertex 
u to the optimal position. It is easy see that if tt is a random uniform ordering, then LS(tt, u) has 
the same distribution as LS(ir' , u), since the new optimal position of u depends only on the relative 
order of other vertices v, and not on the old position of u. Hence, 

E[max f(x u ,{X v } v ^ u )} = E 7r /[value(7r / ,C u )] 

x u £D 

< E 7r /[value(L5(7r / , u),C u )] 
= E w [value(LS(-K,u),C u )]. 



Hence, 



E 7r [value(LS'(7r, u), C) — value(7r, u, C)] = E vr [value(LS'(7r, u), C u ) — value(7r, u, C u )] 

> n k (B-^)A u . 



□ 
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4 Theorem 14.11 



Theorem 4.1. Let D be a set of size 2 L (for some L). Consider a function f : D n+1 — > M. that 
can be represented as a sum of T functions ft ■ D n+l — > R: 

T 

f(x ,xi, ... ,x n ) = ^/ t (x ,xi,. . . ,x n ) 
t=l 

such that each function f t depends on at most k variables x u . Here, x , . . . ,x n G D. Then, the 
following inequality holds for random variables Xq, . . . ,X n uniformly and independently distributed 
in D: 

E[max f(x, X u ...,X n )- f{X , X u ...,X n ))> 

^k(T~ 1 \D\-' k ) max (f(x + ,x 1 ,...,x n )-f(x',x 1 ,...,x n )). 

x+ ,x ,xi,...,x n £D 

Remark 4.1. The variable xq corresponds to x u from the proof of Theorem \3.1\ The functions 
ft{x) are equal to E 7r ^ a ,Ct(7r), where Ct is the t-th constraint from C u . 

Proof. Without loss of generality we assume that elements of D are vertices of the boolean cube 
{— 1, 1} L . We denote the i-th coordinate of x € D by x(i). We now treat / as a function of (n + l)L 
boolean variables x u (i). We write the Fourier series of the function /. The Fourier basis consists 
of functions 

Xs{xo, ■ ■ ■ ,x n ) = Yl x u{i), 
(u,i)es 

which are called character^. Each index S C {0,1, ... ,n} x {1, . . . ,L} corresponds to the set of 
boolean variables {x u (i) : (u,i) G S}. Note, that x<a(, x 0i ■ ■ ■ -, x n) = 1- The Fourier coefficients of / 
equal 

fs = E[f(X ,...,X n )xs(X ,...,X n )}, 

and the function / equals 

f{x ,.. .,x n ) = ^2fsXs{xo, ■ ■ ■ ,x n ). 
s 

Remark 4.2. In the proof, we only use the very basic facts about the Fourier transform. The main 
property we need is that the characters form an orthonormal basis, that is, 

E[ XSl (X ,...,X n )xs 2 (X ,...,X n )} = 

Particularly, for S ^ 0, 

E[ XS (X , ...,X n )}= E[ XS (X , . . .,X n ) X0 (X o , ...,X n )} = 0. 

We will also need the following property: if f does not depend on the variable x u (i), then all Fourier 
coefficients fs with (u, i) £ S are equal to 0. 

3 Of course, \s is a character, because Xs(xoVo, ■ ■ ■ ,x n y n ) = Xs(xo, ■ ■ ■ ,x n )xs(yo, ■ ■ ■ ,Vn)- 



1, ifSi = S 2 ; 
0, ifSi^S 2 . 
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Here is a brief overview of the proof: We will show that the L\ weight of Fourier coefficients 
of / is at least /(x + , xi, . . . , x n ) — f(x~,x±, . . . , x n ), and the weight of one of the coefficients fs* 
is at least £l(T~ 1 \D\~ k (f(x + , xi, . . . ,x n ) — f(x~,x±, . . . , x n ))). Consequently, if we flip a single bit 
X(j(i*) in Xq to make the term fs*Xs* (Xq,Xi, . . . , X n ) positive, we will increase the expected value 
of / by \ f s *\. 

Observe, that since each function ft depends on at most kL boolean variables, it has at most 
2 kL = \D\ k nonzero Fourier coefficients. Thus, / has at most T \D\ k nonzero Fourier coefficients fs- 
Pick x + , x~ , x*, . . . , x* that maximize f(x + ,x\, . . . , x* n ) — f{x~,x\, . . . , x* n ). We have 

f(x + ,X*,...,Xn) - f(x~,X*,...,Xn) = ^ fs(XS (x + , X* X* n ) - Xs{x~ , X* ,-■■,<)) • 

5 

If S does not contain pairs (0, i) corresponding to the bits of the variable xo, then xs(xo, xi, . . . , x n ) 
does not depend on xo, and xs{x + , x \, . . . , x*) — xs(x~,x\, . . . ,x*) = 0, hence 

f(x + ,X*,...,Xn)~ f(x~, x\, ...,<) = ^ f S -(xs(x + ,xl,...,Xn)-Xs(x~,xl,...,Xn)) 

S:3i s.t. (0,i)eS 

< 2 \fs\- 

S:3i s.t. (0,i)eS 

Pick a character fs* with maximum absolute value and pick one of the elements (0, i*) G S*. Since 
the number of nonzero characters fs is at most T\D\ k , 

I ? I ^ /(x~*~, x 1; . . . , x TO ) — /(x , x 1; . . . , x ra ) 
Let a = sgn( fs*). Define a new random variable X' Q on the same probability space as X , . . . , X„, 



X («), for i / i*; 

o-XS*(^o,---,^n)^o(*), for 1 = 1*. 



Consider a character %5. If (0, i*) £ S, then xs does not depend on the bit xo(i*), hence 
E[ XS (X( ) ,X 1 , ...,X n )}= E[xs(X , Xi, . . .,!„)]. On the other hand, if (0,**) G 5, then 



E[xs(^,Xi,...,X n )]=E[^|^xs(^o,^i,--.,^n)] = 



E[<7 X S'(*o, ■ • • , X n ) Xs (X , X 1: ..., X n ) 



0, if 5/ S*; 
a, if S = S*. 



The last equality holds because characters xs form an orthonormal basis. Therefore, 
E[f(X' ,X 1 , ...,X n )- f(X ,X 1 , ...,X n )} = af s * = |/5*|- 

We get 

E[max/(x, X x ,...,X n )- f(X , X 1: ..., X n )] > E[f(X' , X 1: ...,X n )- f(X , Xi, . . . , X n )] = 
|/5*| > ^k(T~ 1 \D\~ k ) max (/(x + ,xi, . . . ,x„) - /(x _ ,xi, . . . ,x„)). 



□ 
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5 Concluding remarks 

We can guarantee that the algorithm finds a solution of value (pQ) with high probability by repeating 
the algorithm Q(B k+2 ) times (since the maxim possible value of the solution is Opt). In the 
Appendix, we give a more complex algorithm with a slightly improved approximation guarantee 
of Avg(V,C) + (Opt(V,C) - AvG(V,C))/O k (B k / 2+2 ). This algorithm can be easily derandomized 
using the method of conditional expectations. 

We note that our local search algorithm works not only for ordering /c-CSPs, but also for 
(regular) /c-CSPs. The algorithm first assigns random values to all variable Xi, and then, n times, 
picks a random i 6 {1, ... ,n}, and changes the value of the variable Xi to the optimal value for 
fixed other variables. The approximation guarantee of the algorithm is Avg(V, C) + (Opt(V, C) — 
Avg(V, Cy)/Ojz d(B), here k is the arity, and D is the domain size of the CSP. The approximation 
guarantee has the same dependence on B as the approximation guarantee of Hastad's (2000) original 
algorithm. The analysis rely on Theorem 14.11 
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A Algorithm with Improved Approximation Guarantee 



In this section, we present an algorithm for bounded occurrence ordering CSPs with a slightly 
improved approximation guarantee. 

> AVC(KC) + ° PT( ^-y C) . (2) 

Theorem A.l. There exists a polynomial time approximation algorithm that given an instance of 
the Bounded Occurrence k-CSP problem outputs a solution of value (0|). 

Proof sketch. The algorithm uses the "bucketing" idea (see (Charikar, Makarychev, and Makarychev 
2007) and (Guruswami and Zhou 2012)). We already used this idea in the proof of Theorem 13.11 
However, this time the algorithm will use buckets directly. The algorithm first finds a mapping of 
vertices to buckets x : V — > [D], then it puts each vertex u in the bucket x u , randomly permutes 
vertices in each of the buckets, and finally outputs vertices in the first bucket, second bucket, etc. 
We need to show how to find a good mapping x. 

Let f(x) be the expected value of the solution for a given mapping x: 



f(x) = E 7r ^Jvalue(vr,C)] = E^ Ux [ £ C(vr,C) 



CeC 

The algorithm maximizes f{x) by picking certain subsets of variables x u (loosely speaking, that 
have large L2 Fourier weight) and setting the values of these variables^- We need to introduce some 
notation to formally describe and analyze the algorithm. Let y* be a partial mapping from V to 
[D] . Think of y* 6 [D] as the value assigned to the variable x v . Denote the set of vertices v for 
which y* is defined by st(y*). Define z = (x \ y*) as z v = y* if v G st(y*); and z v = x v otherwise. 
We let f(x I y*) = f((x \ y*)). The function /(■ | y*) does not depend on the variables x v that are 
"already fixed" in y*. 

The definition of / is similar to the definition we used in the proof of Theorem 13.11 But now 
we compute the total expected value of all constraints, not only those constraints that depend on 
a vertex u. Still, the same argument as in Theorem 0] shows that for a fixed u G V , the L\ weight 
of Fourier coefficients fs with S 3 (u, i) (for some i) is large: 

W u = Yl \fs\>&u- (3) 

S:3i s.t. (u,i)€.S 

We denote the sum above by W u . For each constraint C, let Tc be the set of vertices the constraint 
C depends on. We show that for every u there exists Tc B u of size k, such that the L2 weight of 
Fourier coefficients supported on Tc is at least W u /Ok(B k / 2+1 ). 



4 This approach is somewhat similar to the approaches used in the algorithms of Hastad (2000) and of Guruswami 
and Zhou (2012). In fact, their algorithms can also be used here (since the Li weight of all Fourier coefficients is at 
least Avg(V, C) + Qk(OPT(V,C) — Avg(V,C))). Our algorithm groups characters supported on some set T together 
and picks an assignment which maximizes the sum of these characters, while, loosely speaking, algorithms of Hastad 
(2000) and of Guruswami and Zhou (2012) each time pick an assignment which is "good" only for one of characters. 
This way we get a slightly improved approximation guarantee: Avg(V, C) + (Opt(V,C) - Avg(V, C)) / O k (B kl2+0{1) ) 
instead of Avg(V,C) + (6pt(V,C) - AvG(V r ,C))/O fc (B fc+ ° (1) ). 
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Claim A. 2. Let u be a vertex, y* be a partial mapping V to [D], and g(x) = fix \ y*). Suppose, 
that N(u) n st(y*) = 0. Then, there exists a constraint C 6 C such that 

^ 9s - o k (B k +zy 

S:SCT C X[L] ftV ' 

S^0 

Proof. For each constraint C € C, let h (x) = E, 7rr ^y x C (ir) be the expected value of the constraint 
C for 7r ~ Z/j,.. The constraint C(x) depends only on the position of vertices in Tc, thus h (x) 
depends only on the bits x u (i), where (u,i) G Tc x [L]. Consequently, if a Fourier coefficient /ig is 
not 0, then S C Tc x [L]. By the definition, /(x) = X^ceC ^ \x), and thus /s = Xlcee h^{x). So, 
if / s / 0, then for some C € C, 5 C T c x [L]. Therefore, 



E iM^E E i^i- 

?.t. (u,i)S 

Using ([3]), we get that for some C £ C u , 



S:3ts.t. («,i)eS C*eC u S:ScT c x[L]; 

3i s.t. (u,i)£S 



£ - IcT - ^" 

5:5CT C X[L]; 1 n| 

3i s.t. (n,i)GS 

When we fix the values of variables in st(y*), some of the Fourier coefficients may change. However, 
none of the Fourier coefficients fs for which xs depends on x u (i) may change, since S C Tc 1 x [L], 
and thus SD (st(y*) x [L]) C (N(u)C\st(y*)) x [L] = 0. Hence, gs = fs for all S that contain 
for some i. We get 

E i&i> E ifci- E \h\> w " 



S:ScT c x[L] S:ScT c x[L]; S:S<ZT c x[L]; 

S^0 3i s.t. (u,i)eS 3i s.t. (u,i)&S 



B 



The total number of subsets S C Tc x [L] is at most 2 kL = Ok(B k ). By the Cauchy-Schwarz 
inequality, 

E 



,2 ^ Wl Wl 



S+0 



The next lemma shows how we can iteratively increase the value of /. 



□ 



Claim A. 3. Lety* be apartial mapping V to [D]. Consider a vertexu ^ iV(st(y*)) = U v ^f y »\N(v). 
There exists a partial mapping y** which is an extension of y* with | st(y**) \st(y*)| < k, such that 

9 -g > 



o k (BW2+iy 
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Proof. By Claim IAT21 there exists a set Tq such that 



V 3 2 > Wi 

9s -O k (B k + 2 Y 

S:SdT c x[L} KV ' 

Let 

H x ) = ^2 9SXs(x); 

S:SCT C X[L] 

and let X u be independent random variables distributed uniformly in [D]. For Si 7^ S2, the random 
variables xSi(X) and xs 2 P0 are independent. Thus, we have 

Var W X)] = £ »i>o^y. 

5:5CT C X[L] KV 7 

Therefore, there exists some x* , such that 



o k (B k / 2+i y 

Such x* can be easily found in time O k (\B\ k ), because h depends only on the variables in Tq- We 
set y** = x* v for v £ Tq. It is easy to see that g* = g* z + h(x*). This finishes the proof. □ 

The algorithm computes W v for each vertex v, and sorts all vertices so that W V1 > W V2 > ■ ■ ■ > 
W Vn . Then the algorithm start assigning values to the variables. At every step t, the algorithm 
extends the existing partial assignment to y^ t+1 \ It picks the first vertex u that does not belong 
to iV(st (?/*))), and then extends t/W using Claim [A73l At every step, when a vertex u is chosen, 

(for g^(x) = f(x I y^)) is increased by at least W u /O k {B k l 2+l ). At the same step, at most 
Bk vertices become ineligible for Claim [A3l because the set N(st(y^)) increases by at most Bk 
vertices. So in the end, when N(st(y^)) = V, 

3 (T) > E u Wu J2u^u _ Opt — Avg 

y<3 ~ Bk ■ O k (B k / 2 + l ) ~ O k (B k / 2 + 2 ) O k (B k / 2 + 2 )' 

After iV(st(y( T ))) = V, the algorithm sets values to all yet not assigned variables, so that the 

(T) 

Fourier coefficient g does not decrease (setting one bit at a time). □ 
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